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Abstract — This paper characterizes the capacity of a class of 
modulo additive noise relay channels, in which the relay observes 
a corrupted version of the noise and has a separate channel to 
the destination. The capacity is shown to be strictly below the 
cut-set bound in general and achievable using a quantize-and- 
forward strategy at the relay. This result confirms a conjecture 
by Ahlswede and Han about the capacity of channels with rate 
limited state information at the destination for this particular 
class of channels. 

I. Introduction 

The relay channel is a fundamental building block in 
network information theory. Complete characterization of the 
relay channel capacity would be a first step toward finding 
the capacities of larger networks. Although the capacity of 
the general relay channel is not yet known, the capacities of 
many specific classes of relay channels have been found. These 
special classes include the degraded, reversely degraded [1], 
orthogonal [2], semideterministic [3], and recently a special 
class of deterministic [4] relay channels. All the above relay 
channels for which capacities are characterized have one thing 
in common: they achieve their respective cut-set bounds. This 
makes converses straightforward. Unfortunately it appears that 
the cut-set bound cannot be achieved for many practical 
relay channels. Efforts to find different bounds, or prove the 
looseness of the cut-set bound have proved to be quite difficult. 
Zhang's partial converse [5] demonstrated the latter; Zahedi [2] 
provided some justifications for why the cut-set bound cannot 
be tight in all cases. 

In this paper we find the capacity for a non-trivial class 
of modulo-sum relay channels. In these channels, the relay 
observes a correlated version of the noise between the source 
and the destination, and has a dedicated channel to the 
destination. We show that the capacity can be strictly below 
the cut-set bound, and is achievable by a quantize-and-forward 
strategy [1, Theorem 6]. The quantize-and-forward strategy 
was previously only known to achieve the cut-set bound 
capacity of one class of deterministic relay channels [4]. The 
modulo-sum relay channel appears to be a first example of a 
channel where this strategy achieves a capacity strictly below 
the cut-set bound. 

The quantize-and-forward strategy was designed for use in 
channels where the relay has a poor quality channel from 
the source. In this strategy the relay quantizes its received 
signal, and transmits the quantized signal to the destination. 
The destination first decodes the quantized signal from the 
relay, then uses this signal to help decode the source message. 
The destination may also use its own received signal to help 



the decoding of the quantized signal from the relay, because 
the two signals may be correlated in a general relay channel. 
This technique is known as Wyner-Ziv coding. Quantize-and- 
forward is a natural strategy for the modulo channel considered 
in this paper where the relay observes only the noise. This is 
because there is no message for the relay to decode; all the 
relay can do is to describe the noise to the destination. 

The converse result contained in this paper crucially de- 
pends on two properties of modulo-sum channels. In these 
channels a uniform distribution on the input alphabet achieves 
the maximum possible entropy of the output, regardless of 
the statistics of the additive noise. Further, under a uniform 
input distribution, the output of a modulo-sum channel is also 
independent of the additive noise. This has the consequence 
of simplifying the converse: the side information in Wyner- 
Ziv coding is not useful since the destination's observation is 
independent of the relay's output. 

A relay channel where the relay only gets to observe some 
possibly stochastic function of the noise and has a dedicated 
finite capacity channel to the destination can be viewed as 
a channel with rate limited state information available to the 
destination. The capacity result for modulo-sum relay channels 
coincides with a hypothesis by Ahlswede and Han [6] about 
the capacity of channels with rate limited state information to 
the destination. 

II. A Binary Symmetric Relay Channel 

We begin by deriving the capacity of a particular binary 
symmetric relay channel. The derivation will be directly ap- 
plicable to a broader class of modulo-sum relay channels. The 
simple binary symmetric case is used to distil the essential 
steps and ideas. 

Consider the relay channel as shown in Fig. Q] Here, the 
channel input X goes through a binary symmetric channel 
(BSC) with crossover probability p to reach Y, i.e., Y = X+Z 
(mod 2) with Z being an i.i.d. Ber(p) random variable. The 
relay gets to observe a noisy version of Z, namely Y\ = Z+V, 
where V is an i.i.d. Ber(5) random variable. The relay also 
has a separate BSC to the destination S = X\ + N, where N 
is an i.i.d. Ber(e) random variable. 

Let us define 



Rq = max I{X\] S), 



(1) 



for future reference. If there were no corrupting variable V, 
then the capacity of this channel is as recently characterized 



V ~ Ber(6) N ~ Ber(e) 
2 ~ Ser(p) ^0 U \ Relay | ^© ■fs] 



Fig. 1, A Binary Relay Channel 

in [4] 

C = maxmin{/(X; F) + R , I(X; Y, Yx)}. (2) 

p(x) 

Both hash-and-forward [4], a strategy where the relay 
simply hashes Y\ into equal sized bins, and the classic 
quantize-and-forward are capacity achieving. The multiple 
access cut-set bound is I(X;Y) + Rq. This bound is obtained 
by considering the achievable rate assuming that the relay 
already knew the message the source would like to transmit. 
One way to interpret the achievability of the multiple access 
cut-set bound is that if V were absent, decoding X is 
the same as decoding Z. So, the relay, by sending parity 
information about Z, can be interpreted to be performing 
a version of decode-and-forward, as if it already knows the 
message; random parities for Z turn into random parities for 
X. This interpretation would fail if the relay's observation of 
Z is corrupted by V. To the best of the authors' knowledge, 
the capacity of this class of relay channels when V is present 
has not been characterized previously. 

The following is a reasonable strategy for this channel. 
The relay tries to quantize Y\ in such a way as to minimize 
the uncertainty about Z at the destination. The main result of 
this paper is that the above approach is capacity achieving for 
a class of modulo-sum relay channels including the channel 
in Fig. Q] 

Theorem 1: The capacity C of the binary relay channel in 
Fig. 03 is 

C= max l-H(Z\U) (3) 

p(«|yi):7(;7:Yi)<i?o 

where the maximization may be restricted to C/'s with 
\U\ < \yi\ + 2, and R Q is as defined in (Q). 

A. Proof of Achievability 

Fix the input distribution of X as Ber(^). The capacity 
can be achieved by a direct application of Theorem 6 in [1], 
if we identify U with Y\. A separate proof is provided here for 
completeness based on the theory of jointly strongly typical 
sequences [7]. 

We transmit at rate R over B — 1 blocks, each of length 
n. For the last block no message is transmitted. As B — > oc, 
R ( B ~ X } becomes arbitrarily close to R. 

Codebook Generation: Generate 2 nR independently and 
identically distributed n-sequences, X(w),w G {1...2 nR } 



where each element is generated i.i.d. ~ 11™= i p{xi), and 
p(xi) has the Ber(^) distribution. Fix a p(u\yi) such that it 
satisfies the constraint I(U; Yi) < R . Generate 2 nI{ - u ' Xl ' ) i.i.d 
rt-sequences, U(i), t G {1 . . . 2 n/ ( c/;Yl )} where each element 
is generated i.i.d. ~ \~[™=iP(ui). 

Encoding: We describe the encoding for block i. To send 
message Wi,Wi G {1...2 nR }, the transmitter simply sends 
X(u>i). The relay, having observed the entire corrupted noise 
sequence from the previous block Yi^-i, looks in its U 
codebook and finds a sequence U(i;) that is jointly strongly 
typical with Yi,j_i. It encodes and sends its index U across 
the private channel to the destination. Only the relay transmits 
to the destination in the last block B. 

Decoding: The destination, upon decoding t.- L , looks for a 
Wi^i such that X(wi_i) is jointly strongly typical with both 
Ufe), and Yj_x. 

Analysis of the Probability of Error: Because of the 
symmetry of the code construction we can perform the 
analysis assuming X(l) was sent over all the blocks. Since 
the decodings of different blocks are independent we can 
focus on the probability of error over the first block, and drop 
the time indices. The error events are: 



Ei : 


(X(1),Y, Yi) are not jointly strongly typical. 


E 2 : 


fit, (U(t),Yi) are jointly strongly typical. 


E3 : 


(X(l), Y, U(t)) are not jointly strongly typical. 


£4 : 


The destination makes an error decoding t in the 




next block. 


E 5 : 


3w ^ 1, (X(u;), Y, U(£)) are jointly strongly 




typical. 



For n sufficiently large we have P{Ei) < and 
P{E 2 n Ef) < ^g. By the Markov lemma [7, Lemma 
14.8.1], since (X(l), Y) - Y x - U(<) forms a Markov chain, 
H El n E$) < for n sufficiently large. Since by 
construction I(U;Yi) < Rq, the index t can be sent to the 
destination with an arbitrarily small probability of error so 
P(£ , 4) < Finally, the probability that another randomly 
generated X(w) is jointly strongly typical with both Y and 
U(£) is less than 2~ n ( I ( X ' Y - u * > ~ 1 \ Using the union bound, we 
have, P(E 5 n Di=i Ef) < 2 nR 2- n{I{X ' Y > u ^ . Thus, when 

R<I(X;Y,U), (4) 

we have P(E 5 n C\ i=1 Ef) < for sufficiently large n. 
Now, since X and U are independent, we have 

I(X;Y,U)=I(X;Y\U) (5) 
= H(Y\U) - H{Z\U) (6) 
= l-H{Z\U), (7) 

where H{Y\U) = 1, because for binary symmetric channels 
under the uniform input distribution Ber(^), the output Y is 
independent of the additive noise Z, and hence U. Collecting 
terms we see that P(Uj=i Ei) < -g, so that using the union 
bound again we can make the probability of error over all of 
the B blocks less than e as long as R < 1 — H(Z\U). 



B. Converse 

The converse will be easy once we prove the following 
lemma. 

Lemma 1: Let Z, V, N be independent Bernoulli random 
variables and let Y 1 = Z + V, Y = X + Z, and S = X 1 + N 
as shown in Fig. Q] The following inequality holds for any 
encoding scheme at the relay, 

H(Z n \S n ) > min nH(Z\U) (8) 

p(u\ Vl ):I(U;Yl)<Ro 

where the minimization on the right-hand side may be 
restricted to [Ts with \U\ < |^i| + 2. 

Proof: The proof of the lemma is closely based on the 
proof of [7, Theorem 14.8.1]. Fixing an encoding scheme at 
the relay, our strategy is to show that there always exists a U 
for which H(Z n \S n ) > nH(Z\U) and I(Y t ;U) < Rq. This 
would allows us to conclude that 

H(Z n \S n ) > min nH(Z\U). 

p(ii|»i):J(l/;Yi)<-Ro 

We start by finding a lower bound for H (Z n \S n ): 

n 

H(Z n \S n )=Y,n(Z i \S n ,Z 1 ,...,Z i _ 1 ) (9) 

i=l 
n 

> ^HiZilS* 1 ,^- 1 ,^- 1 ) (10) 

8=1 

n 

= Y J H{z l \s n ,vr 1 ) (id 

where in the third line we use the fact that Zi — S n Yl — 
S n Y^~ 1 Z 1 ^ 1 forms a Markov chain. The Markov chain fol- 
lows because Zi's are i.i.d., S n is only a function of Y{ 1 , and 
Zi can only be affected by Z 1 ^ 1 through S n . Now define 

Ui = (S^Yj 4-1 ), we get: 



H{Z n \S n )>Y,H(Zi\Ui). 



(12) 



i=l 



Next, note that Z — Y\ — X\ — S forms a Markov chain. As 
a result, 

I(X"; S n ) > I(Y™; S n ) (13) 

n 

= ^7(Y li ;5"|Y 11) ...,Y 1(i _ 1) ) (14) 



Y d I{Y li -S n ,Yt l ) 



(15) 



where in the third line we use the fact that Yu is independent 
of Y^ 1 and consequently /(YhjYj 4-1 ) = 0. Using our 
definition of U we get 



Recall that Rq = max I(Xi;S). Thus, we have shown the 

pOi) 

following inequalities: 

1 - 

Ro>-Y,I(y u ;Ui) (17) 

i=i 
n 

-H{Z n \S n )>-YH{Zi\Ui). (18) 
i=i 

Introducing a standard timesharing random variable Q, the 
above equations can be rewritten as 

1 - 

i2o > -Y^^Y^U^Q = i) = I(Y 1Q ;U Q \Q) 
i=i 

(19) 

n 

-H{Z n \S n ) > -J^HiZilUi, Q = i) = H(Z Q \U Q , Q) 

i=l 

(20) 

Now, since Q is independent of Y\q, we have 

I(Yiq; U Q \Q) = I(Y 1Q ;U Q , Q) - I(Y 1Q ;Q) = I(Y 1Q ; U Q , Q). 

(21) 

Finally, Y\q and Zq have the same joint distribution as Y\ 
and Z, so defining U = (Uq,Q), Z = Zq and, Y\ = Y\q, 
we have shown the existence of a random variable U such that 



Ro>I{Yx;U) 
H{Z n \S n ) > nH(Z\U) 



(22) 
(23) 



for any particular encoding scheme at the relay. Since for every 
possible encoding scheme at the relay we can construct an 
i.i.d. U satisfying the above equations, the minimum over all 
U's satisfying I(U; Y) < Rq must satisfy ([8]). The cardinality 
bound is the same as in [7, Theorem 14.8.1]. ■ 

The converse can now be proved in a straightforward 
manner with: 



nR = H(W) 

= I(W; Y™, S n ) + H(W\Y n , S n ) 

< I(W;Y n ,S n )+ne n 

< I(X n ;Y n ,S n )+ne n 

= I(X n ;Y n \S n )+ne n 

= H(Y n \S n )-H(Y n \S n ,X n ) + ne r , 



(16) 



< n- H(Z n \S n ,X n )+ne n 
= n-H(Z n \S n )+ne n 

(d) 

< max nil - H(Z\U)) + ne n (32) 

p(«|yi):/(;7;yi)<flo 



(24) 
(25) 

(26) 
(27) 

(28) 
(29) 

(30) 
(31) 



nC + ne r , 



(33) 



where 

(a) follows from Fano's inequality, 

(b) follows from the fact that X n is independent of S n , 

(c) follows from the fact that the maximum entropy 
of a binary random variable of length n is n, 

(d) follows from Lemma 1. 

Thus, we have shown that for any relaying scheme with a 
low probability of error, R < C. 

C. Comments on Theorem 1 

The capacity of the binary symmetric relay channel 
considered above is achieved essentially by digitizing the 
separate channel between the relay and destination. All 
that matters is that the capacity of the separate channel is 
sufficiently high to support the relay's description of U, the 
quantization variable. There is no advantage in joint source 
channel coding at the relay. The input codebook for X is 
drawn from the uniform Ber(^) distribution, identical to the 
capacity achieving distribution if the relay were absent; the 
source merely increases its rate once the relay is introduced. 

There are two conditions which are important for the 
converse to work. The channel between the source and 
destination should be additive and modular. These two 
conditions allow for two crucial simplifications in the 
converse. First, a uniform input distribution maximizes the 
output entropy, regardless of any information that the relay 
may convey about the noise; this was used in (f30b . Second, 
the linear nature of the channel, combined with the expansion 
in (129t . reduces the role of the relay to essentially source 
coding with a distortion metric being the conditional entropy 
of Z. This is in contrast to a general relay channel where 
the relay observes a combination of the source message and 
noise, so there is an opportunity for the destination to use its 
received signal to act as side information in the decoding of 
the relay's quantized message. For the binary symmetric relay 
channel, the uniform input distribution completely eliminates 
any aid the destination's output can provide in the decoding 
of the relay's message; this makes the converse easier to prove. 

D. Capacity Can be Below the Cut-set Bound 

To see that the capacity of Theorem 1 can be strictly below 
the cut-set bound, consider the case in which Z n has an i.i.d. 
£?er(i) distribution. The capacity can now be evaluated as 

C = l-h(hT 1 {l- Ra)*8), (34) 

where h(p) = —p\og 2 p — (1 — p) log 2 (l — p), and a * (3 — 
a(l— j3) + (l— a)/3. This capacity expression follows by noting 
that I(U;Yx) = H(Yi) - H(Yi\U), so that the constraint in 
the maximization of Theorem 1 can be rewritten as 

H{Y 1 \U) >H{Y 1 )-R . (35) 

Now we use Wyner and Ziv's version of the conditional 
entropy power inequality for binary random variables [8] to 
claim that if 

H{Yx\U)>a, (36) 



then 

H(Z\U) > h{h~ l {a) *S), (37) 

with equality if Y\ given U is a Ber(h~ 1 (a)) random variable. 
Wyner and Ziv's inequality holds because when Z is Ber(^) 
we can write Z = Y\ + V, where V is Ber(5) and Y\ and V 
are independent. 

Now, let a — H(Yi) — Rq. Observe that the U that achieves 
equality in d37l i. i.e., the U that gives rise to Y\ given U as 
Ber(h^ 1 (H(Yi) — Rq)), is precisely the U that minimizes 
the Hamming distortion of Y\ under a rate constraint Ro in 
standard rate-distortion theory. This is because rate-distortion 
theory states that for binary random variables, under a rate 
constraint Ro, the minimum achievable average distortion v 
must satisfy H{y) = i?(Yi|t/) = H(Y 1 ) - R and Yi given 
U must be Ber(y). Further, as Y\ is Ber(|), the distribution 
of the optimal U is also Ber(^). The capacity d34"i > follows 
by using this U in (01 and by substituting H(Yi) = 1 and 
a = 1 - Ro in dJ7Y 

We now show that the capacity as given in d34l ) is strictly 
below the cut-set bound. The cut-set bound equals [1] 

max mm{I(X,Xi;Y,S),I(X-,Y,S,Yi\Xi)}. (38) 

p(x.xx) 

When Z is Ber(^), we have 

I(X,Xi;Y,S) =H(Y,S)-H(Y,S\X,Xi) (39) 

<2-H(Z,N\X,Xi) (40) 

= 1 - H(Z) + 1 - H(N) (41) 

= Ro, (42) 

where the equality in PR)} is achieved by letting X and X\ 
have independent and identical Ber(^) distributions. 
Similarly, for the broadcast bound we have 

I(X;Y,S,Yi\X 1 )=I{X;Y\S,Y 1 ,X 1 ) (43) 

= H(Y\S, Y 1 ,X 1 ) - H(Z\S, Y X ,X, Xi) 

(44) 

<i-ir(v|s,yi,jr,^i) (45) 

= 1-H(8). (46) 

In the first line, we use the fact that X is independent of 
Y\ and S given X\. In the third line, we again use the fact 
that Yj = Z + V and since Z is Ber(^), so is Y\, thus 
Z = Y\ + V, and Y\ and V are independent. The equality in 
(|45| | is achieved again with X and X\ as independent Ber(^) 
distributed random variables. Since both d40b and d45t are 
achieved with equality with the same maximizing p(x, xx), we 
have shown that the cut-set bound for this particular channel 
is equal to 

min{i? , 1 - H(5)}. (47) 

The capacity given by (|34| > is strictly below the cut-set bound 
for all values of R > 1 - H(5). 
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Fig. 2. The modulo-sum Relay Channel 
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Fig. 3. Channel with rate limited state information to the decoder 



III. Extension to Modular Relay Channels 

We now extend the capacity results in Section [XT] to include 
the general modulo-sum relay channel depicted in Fig. [2] 
The source and the destination are related by a modulo-sum 
channel. The relay observes Y x , which is a correlated version 
of the noise Z with a conditional distribution p(yi\z). The 
relay also has a dedicated channel to the destination with a 
capacity 

R = max I(Xi;S). (48) 

p(zi) 

The binary symmetric relay channel considered in Section [TT] 
is a specific instance of the modulo-sum relay channel. The 
capacity proof for the binary case can be augmented to give 
the capacity of the modulo-sum relay channel. 

Theorem 2: The capacity of a modular and additive relay 
channel, in which the relay observes Yi, with p(yi\x, y, z) = 
p(yi\z), and the destination observes Y = X + Z mod m from 
the source and S from the relay through a separate channel 
with transition probabilities p{s\x\), is 

C= max m-H(Z\U) (49) 

f>H2/i):J(£/;Yi)<.Ro 

where the maximization may be restricted to J7's with 
\U\ < \yi\ + 2, and R is as defined in Bl . 

Achievability follows by applying a simple extension to the 
achievability proof of Theorem 1. The binary symmetric relay 
channel converse appropriately modified to reflect the different 
alphabet sizes remains valid. This is because all the necessary 
conditions for the converse to work are satisfied. The modulo- 
sum channel is linear, and the uniform distribution applied at 
the input maximizes the output channel entropy regardless of 
how much is known about the additive noise, so ( f30b holds. 

IV. Connection to Ahlswede-Han conjecture 

The Ahslwede-Han [6] conjecture states that for channels 
with rate limited state information to the decoder as shown in 
Fig. [3] the capacity is given by, 

C = max I(X;Y\S') (50) 

where the maximum is taken over all probability distributions 
of the form p(x)p(s')p(y\x, s')p(s'\s') such that 

I(S';S'\Y) <R 

and the auxiliary random variable S' has cardinality 

|5'|<|5'| + 1. 



For these channels, the output Y depends stochastically on 
both the input X and the particular channel state S'. The 
channel state is observed at another encoder that has a digital 
link to the destination with capacity Ro. The conjecture claims 
that the state variable S' should be quantized at rate R in 
such a way as to maximize the resulting mutual information 
between X and Y. By identifying S' with Yi, and S' with 
U, we observe that the class of relay channels described 
in Theorem 2 is a special case of the channel with rate 
limited state information to the decoder. We also note that the 
uniform distribution on X maximizes the capacity and makes 
Y independent of S', so that the rates achievable by (T50b and 
(|49l are identical thus confirming the conjecture for the class 
of channels described in this paper. 

V. Conclusion 

The capacity of a class of modular additive relay channels 
was found. The capacity was shown to be strictly below the 
cut-set bound and achievable using a quantize-and-forward 
scheme where quantization is performed with a new metric, 
the conditional entropy of the noise at the destination. This is 
the first example of a relay channel for which the capacity can 
be strictly below the cut-set bound. It was proved that there 
is no advantage to performing joint source channel coding of 
the relay's message over its dedicated link to the destination; 
digitizing the link is capacity achieving. The capacity derived 
here confirms a conjecture by Ahlswede and Han about the 
capacity of the rate limited channels with state information for 
this class of channels. 
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